A Study of Intensional Concept Drift in Trending DBpedia Concepts
نویسندگان
چکیده
Concept drift refers to the phenomenon that concepts change their intensional composition, and therefore meaning, over time. It is a manifestation of content dynamics, and an important problem with regard to access and scalability in the Web of Data. Such drifts go back to contextual influences due to social embedding as suggested by e.g. topic analysis, news detection, and trends in social networks. Using DBpedia as a source of timestamped Linked Open Data, we analyze the interaction between a sample of popular keywords, as recorded by Google Trends, and their respective concept drifts in DBpedia. For the latter task, we deploy SemaDrift, an ontology evolution platform for detecting and measuring content dislocation dependent on context modification. Our hypothesis is that social embedding and awareness is an important trigger for concept drift in crowdsourced knowledge bases on the Web.
منابع مشابه
Expansion of Tail Concept Using Web Tables
Human-curated knowledgebases like Freebase and DBPedia cover popular concepts such as persons, organizations and locations, but many more specific concepts fall into the long tail outside current knowledgebases, such as acidic fruits, HD video formats and renewable resources. These concepts are found in conceptentity pairs automatically extracted from text documents, but they cover a limited nu...
متن کاملWhat Is Concept Drift and How to Measure It?
This paper studies concept drift over time. We first define the meaning of a concept in terms of intension, extension and label. We then introduce concept drift over time and two derived notions: (in)stability over a time period and concept shift between two time points. We apply our framework in three case-studies, one from communication science, on DBPedia, and one in the legal domain. We des...
متن کاملConcept drift and how to identify it
This paper studies concept drift over time. We first define the meaning of a concept in terms of intension, extension and label. Then we study concept drift over time using two theories: one based on concept identity and one based on concept morphing. A qualitative toolkit for analysing concept drift is proposed to detect concept shift and stability when concept identity is available, and conce...
متن کاملEvolutionary Conceptual Clustering Based on Induced Pseudo-Metrics
We present a method based on clustering techniques to detect possible/probable novel concepts or concept drift in a Description Logics knowledge base. The method exploits a semi-distance measure defined for individuals, that is based on a finite number of dimensions corresponding to a committee of discriminating features (concept descriptions). A maximally discriminating group of features is ob...
متن کاملDetecting Concept Drift in Data Stream Using Semi-Supervised Classification
Data stream is a sequence of data generated from various information sources at a high speed and high volume. Classifying data streams faces the three challenges of unlimited length, online processing, and concept drift. In related research, to meet the challenge of unlimited stream length, commonly the stream is divided into fixed size windows or gradual forgetting is used. Concept drift refer...
متن کامل